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Abstract 

We are developing a laboratoty-scale model to improve our understanding and capacity to assess the biological risks of 
genetically engineered bacteria and their genetic elements in the natural environment. Our hypothetical scenario concerns 
an industrial bioreactor failure resulting in the introduction of genetically engineered bacteria to a downstream municipal 
wastewater treatment plant (MWWTP). As the first step towards developing a model for this scenario, we sampled microbial 
communities from the aeration basin of a MWWTP at three seasonal time points. Having established a baseline for 
community composition, we investigated how the community changed when propagated in the laboratory, including cell 
culture media conditions that could provide selective pressure in future studies. Specifically, using PhyloChip 16S-rRNA- 
gene targeting microarrays, we compared the compositions of sampled communities to those of inocula propagated in the 
laboratory in simulated wastewater conditionally amended with various carbon sources (glucose, chloroacetate, D- 
threonine) or the ionic liquid l-ethyl-3-methylimidazolium chloride {[C2mim]CI). Proteobacteria, Bacteroidetes, and 
Actinobacteria were predominant in both aeration basin and laboratory-cultured communities. Laboratory-cultured 
communities were enriched in y-Proteobacterla. Enterobacteriaceae, and Aeromonadaceae were enriched by glucose, 
Pseudomonadaceae by chloroacetate and D-threonine, and Burkholderiacea by high (50 mM) concentrations of 
chloroacetate. IVlicrobial communities cultured with chloroacetate and D-threonine were more similar to sampled field 
communities than those cultured with glucose or [C2mim]CI. Although observed relative richness in operational taxonomic 
units (OTUs) was lower for laboratory cultures than for field communities, both flask and reactor systems supported 
phylogenetically diverse communities. These results importantly provide a foundation for laboratory models of industrial 
bioreactor failure scenarios. 
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introduction 

In recent years, there have been increasing numbers of 
genetically engineered microorganisms (GEMs) constructed for 
biotechnology, medicine, bioenergy, bio-based chemicals and 
materials, and bioremediation applications [1-4]. While contain- 
ment strategies including engineered auxotrophy, induced lethal- 
ity, and gene-flow barriers have been developed to mitigate the 
environmental risks posed by GEMs, naturally occurring processes 
may negate these genetic safeguards [5]. Currendy, we cannot 
confidently predict how well a given GEM wiU survive and adapt if 
released into a given environment, nor how it will impact 



ecological biodiversity or interact with indigenous microbes {e.g., 
horizontal gene transfer and subsequent phenotype alteration 
[6,7]). 

A potential scenario to consider is an industrial bioreactor 
failure that results in the exposure of GEMs to the microbial 
communities indigenous to a downstream municipal wastewater 
treatment system. 

Such a wastewater treatment system typically performs primary 
and secondary biological treatment (activated sludge) of municipal 
wastewater before release, and possibly tertiary treatment (micro- 
filtration) for applications including landscape and agricultural 
irrigation, industrial processes, and wetlands restoration. Domestic 
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sewage and other wastewater received by the treatment plant first 
undergo primary sedimentation. Effluent from the primary 
sedimentation basins flows to aeration basins where divers 
microorganisms (originating from the received wastewater) con- 
sume the soluble organic matter. Effluent from the aeration basins 
flows to clarifiers, which sediment the microorganisms from the 
water. The clarifier effluent is either discharged or further 
processed for re-use. The clarifier sediment, or "activated sludge", 
is recycled back to the aeration basins to maintain a high cell 
density of microorganisms and removal rates of soluble organic 
matter, while th(' net growth of microorganisms is sent to 
anaerobic digesters, which also receive primary sedimentation 
sludge. The material of the aeration basin is referred to as "mixed 
liquor", reflecting the mixture of the recycled activated sludge and 
the inflow to the aeration basin. The aeration basin contains a 
complex and high-density microbial community, which could 
serve as a hot spot for horizontal gene transfer. 

While a given laboratory GEM itself may be unlikely to survive 
in an aeration basin, the portion, if any, of the GEM's genetic 
material that confers advantage to others in the microbial 
community (post horizontal gene transfer) may have a greater 
chance for survival. Our long-term primary questions of interest 
(only partially resolved by the ^\-ork reported herein), then, are 
which selectable capabilities found within GEM genetic material 
might provide sufficient advantage to be effectively transferred to 
other members of the community, how the composition of the 
community would be affected by gene transfer, and how is this 
transfer limited by environmental constraints. 

Efforts to characterize microbial structure and gene expression 
within a Hong Kong MWWTP activated sludge community [8], 
to investigate metagenomic determinants of reduced antimicrobial 
drug susceptibility within MWWTP bacteria [9,10], and to 
propagate MWWTP microbial communities in replicate mem- 
brane bioreactors [1 1] in the presence of the model toxin 3- 
chloroaniline [12], have all been recently reported. In order to 
further advance our understanding of potential gene transfer from 
GEMs to a diverse microbial community representative of a 
municipal wastewater treatment plant (MWWTP), it is important 
to establish suitable selection conditions (media and genetic 
markers) for the gene transfer, and to characterize the baseline 
microbial communities resulting from each condition. To this end, 
we inventoried and compared aeration-basin bacterial communi- 
ties and laboratory-grown communities, which were cultured with 
various carbon sources using preserved and maintained mixed- 
liquor communities as inoculum. For the laboratory model system, 
we selected carbon sources as selective media components that 
promote the activity of carbon catabolic genes. These selective 
markers, which could be overexpressed in GEMs in future 
experiments were hypothesized to be rare in MWWTP microbial 
communities. For example, we selected the ionic liquid l-ethyl-3- 
methylimidazolium chloride ([C2mim]Cl), a microbial growth 
inhibitor and deconstruction agent for lignocellulosic feedstocks for 
biofuels-producing GEMs [13], as a selective medium that 
promotes the activity of rare efflux-pump tolerance genes. An 
efflux pump for [C2mim]Cl was recendy identified in Enterobacter 
lignolyticus SCFl and cross-validated in E. coli [14], which is 
evidence that horizontal gene transfer could confer the 
[C2mim]Cl tolerance phenotype. We also screened 192 carbon 
sources employing the phenotype microarray, OmniLog PM 
MicroPlate (BioLog, Hayward, CA) and identified carbon sources, 
including D-threonine, that neither MWWTP microbial commu- 
nities nor E. coli (a potential parental lineage for GEMs) appeared 
to metabolize weU, showing very poor growth compared to other 
carbon sources (data not shown). Note that the selected carbon 



sources, including D-threonine, chloroacetate and [C2mim]Cl, 
were not chosen because they are likely be found in (or especially 
relevant to) MWWTPs, but rather because they could potentiaUy 
serve well as selective media components for future laboratory 
experiments. We hypothesize that the selected carbon sources will 
prove more effective than antimicrobial drugs as selective agents 
for future laboratory experiments since resistance genes for various 
antimicrobial drugs already appear to be prevalent in activated 
sludge microbial communities [9,10]. 

In this study, we have used PhyloChip G3, a third generation 
high-density phylogenetic DNA microarray that has been 
successfufly employed in microbial ecology [15-18] to investigate 
microbial community structure. 

Materials and Methods 

Overall experimental design 

An overview of the experiments conducted herein is schemat- 
ically depicted in Fig. 1. 

Aeration basin samples 

Samples of mixed liquor from the secondary treatment unit of a 
local San Francisco Bay Area wastewater treatment plant with 
typical secondary processes that discharges into San Francisco Bay 
were obtained from the channel that flows from the aeration 
basins to the clarifiers. (John R. Cloak, who is authorized to give 
aU required permission and should be contacted for future 
permission, facilitated sample acquisition on private land and 
provided plant operation data. Sample acquisition did not involve 
endangered or protected species.) On three sampling dates (July 
20, 2011. October 19, 2011, and April 2,5, 2012), four sterile 1-L 
bottles were each fiUed with 500 mL of sample and transported at 
ambient temperature to the laboratory for cryo-preservation as 
inoculum for subsequent culturing and PhyloChip characteriza- 
tion (described below). Inoculum was preserved by adding 1.5 mL 
of sterile 45% (w/v) glycerol solution to 3 mL of sample, and then 
storing it at — 80°C. Larger 75 mL aliquots (50 mL sample with 
25 mL 45% (w/v) glycerol solution for bioreactor inoculation) 
were also preserved from the April 25, 2012 samples. Glycerol 
stock aliquots were prepared under constant stirring to mitigate 
sludge microbe flocculation and maintain homogenous samples. 

Laboratory cultures 

Batch cultures were grown at 26°C in baffled Erlenmeyer flasks 
(50 mL culture in 250-mL flask shaken at 250 rpm on an orbital 
shaker) and bioreactors (500 mL custom-manufactured vessels 
used with controllers manufactured by Fairmentec GmbH, 
Gottingen, Germany, Bioreactor RKOl-40). Each liter of the 
defined basal medium contained 0.87 g K2HPO4, 0.88 g 
KH2PO4, 5.5 mg CaCla, 0.11 g NH4CI, 31 mg CH4N2O, 0.2 g 
MgS04.7H20, 2.4 mg CfiHsFeO,, 25 mg H3BO3, 2.9 mg 
ZnS04.7H20, 0.6 mg KI, 2.5 mg CuS04.5H20, 8.4 mg Co- 
S04.7H20, 0.8 mg Na2Mo04'2H20, 16 mg MnCl2'4H20, and 
63 mg N(CH2COONa)3 (nitrilotriacetic acid trisodium salt). The 
bioreactors were aerated with sterile filtered (0.2 \im) air through 
glass frits, pH controlled to 6.8 with 4% HCl and 20 g/L Na2C03 
and agitated by means of a magnetic stir bar at 400 rpm. 
Dissolved oxygen monitoring of the bioreactors indicated that 
oxygen levels remained at saturation throughout growth. The 
reactors and flasks were inoculated to an initial ODsoo nm of 0.05, 
using the preserved April 25, 2012 samples, which were washed 
three times with the basal medium to remove residual glycerol 
before inoculation. 
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Figure 1. Overview of Experiments. All substrates were tested in triplicate, except for 'No-Carbon in bioreactors', which was tested in duplicate. 
The No-Carbon and [C2mim]CI triplicates in flasks were pooled in order to provide adequate biomass for PhyloChip analysis. 
doi:1 0.1 371 /journal.pone.01 05689.g001 



Carbon sources were added to the basal medium to provide an 
equivalent carbon concentration of 0.26 g C/L, to approximate 
the range of carbonaceous biochemical oxygen demand values of 
the primary sedimentation effluent at the wastewater treatment 
plant. Glucose was selected as a carbon source that could 
potentially be metabolized by a broad range of microorganisms. 



D-threonine (5.5 mM), chloroacetate (1 1 mM), as well as the ionic 
liquid [C2mim]Cl (270 mM) were used as candidates for selective 
carbon sources. Chloroacetate was also tested at higher concen- 
trations (23 mM and 50 mM) that would potentially be growth 
inhibitory to the aeration-basin microorganisms. As controls, 
cultures were grown in basal medium without any additional 
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carbon source (no-carbon). Negative controls in glucose-amended 
basal medium were also performed in the flasks. 

Growth of the batch cultures was monitored by taking samples 
for ODgoo and substrate concentration measurement. In the 
reactors, ODsoo was continuously monitored by pumping a 
recycle line through a cuvette in a small spectrophotometer 
(USB400, Ocean Optics, FL). Samples for PhyloChip were taken 
once cultures reached stationary phase, or immediately following 
the peak OD^oo nm- All conditions were tested in triplicate, with 
the following exceptions: one negative-control was run per batch 
of flask tests, and two no-carbon reactor tests were performed. 

DNA extraction and PCR for microarray analyses 

Microbial community genomic DNA was extracted from three 
biological replicates per sample whenever available. For each 
aeration-basin sample, a 4.5 mL stock vial containing 3 mL 
sample and 1.5 mL of sterile 45% (w/v) glycerol was considered a 
biological replicate and gDNA was extracted from three stock vials 
per sample-date. For laboratory-cultured samples, three flask-, or 
three reactor-cultures per each treatment were used for gDNA 
extraction. When the microbial cell mass was very low 
([C2mim]Cl, and no-carbon flask control cultures), the three 
biological rephcates were pooled to obtain sufficient amount of 
gDNA. 

Biomass for gDNA extraction was concentrated either by 
centrifugation for the aeration-basin samples, or by filtration 
through Sterivex GP filter units with 0.22 Jim pore size (Millipore; 
BiUerica, MA) for the laboratory cultures. Sample was pushed 
through the filter with a syringe until no more filtrate would pass. 
The amount of samples filtered ranged from 5 to 60 mL, 
depending upon the cell density of the sample. The filters were 
asepticaUy removed from the filter units and placed into separate 
sterile 1.5 mL microcentrifuge tubes and stored overnight at 4°C 
in KNAlater solution (Ambion; Austin, TX). On the following day, 
the RNAlater solution was removed and the filters were stored at 
— 80°C until further processing. 

A DNA-EZ Kit (GeneRite; North Brunswick, NJ) was used for 
gDNA extraction following the manufacturer's instruction en- 
hanced by bead beating. Briefly, filters were cut asepticaUy and 
each filter piece, or one third of the pellet, was placed in Lysing 
Matrix E tubes (MP Biomedicals; Solon, OH) with 500 |xL of the 
kit's lysing buffer and treated in a FastPrep bead beater (Savant; 
Carlsbad, CA) at 5.0 m/ s for 45 s twice. The lysed solution was 
applied to a DNA binding column. After several washes, gDNA 
was eluted with 100 |J,L of elution buffer. The eluted sub-samples 
were pooled and the gDNA quantified using a Qubit dsDNA HS 
Assay Kit (Invitrogen, Grand Island, NY). 

Bacterial 16S rRNA genes were amplified with primers 27F (5'- 
AGAGTTTGATCCTGGCTCAG-3') and 1492R (5'- 
GGTTACCTTGTTACGACTT-3'). Each PCR reaction con- 
tained 1 xEx Taq buffer (Takara Bio Inc., Japan), 0.025 units/|j,L 
ExTaq polymerase, 200 mM dNTP mixture, 1.0 |J.g/nL BSA, 
200 nM each primer, and 1 ng genomic DNA. The temperature 
gradient PCR conditions were 95°C (3 min), followed by 25 cycles 
at 95°C (30 s), 48-58°C (30 s), and 72°C (2 min), and a final 
extension at 72°C (10 min). Each sample was amplified in 8 
replicate 25-|J,L reactions (one tube per temperature). Amplicons 
from the gradient amplification were pooled, purified with the 
QIAquick PCR purification kit (Qiagen; Valencia, CA), and then 
quantitated using gel electrophoresis (2% agarose E-gel; Invitro- 
gen). 



Microarray processing and data treatment 

PhyloChip assay was conducted as previously reported [19] with 
the following modifications: (1) TDT enzyme (P/N: PM1875; 
Promega; Madison, WI) and GeneChip Labeling Reagent (P/N: 
90042; Affymetrix; Santa Clara, CA) were used for the labeling 
reaction, and (2) 2.2 nL of 3 nM Control Ohgo B2 (Afiymetrix) 
was used for the hybridization reaction. In brief, 500 ng of purified 
PCR product was fragmented to 50-200 bp, labeled with biotin, 
hybridized onto the microarray, stained and waslied according to 
the manufacturers instructions. Fluorescent images were scanned 
with Afiymetrix GeneChip Scanner 3000 7G. Probe intensities 
were background-subtracted and scaled to the quantitative 
standards. The hybridization score for an OTU was calculated 
as the mean intensity of the perfectly matching probes excluding 
the maximum and minimum intensity values. Technical replica- 
tion of the PhyloChip was not analyzed in this study, since high 
reproducibility of the PhyloChip has been reported [19]. 

On the PhyloChip, the average number of probe pairs assigned 
to each OTU was 37 (s.d. 9.6) [19]. Probe pairs comprised one 
perfectly matching (PM) 25-mer probe and one mismatching 
(MM) 25-mer probe, containing a substitution at the central base. 
The process for scoring an OTU as present/ absent occurs in two 
stages of analysis. Stage 1 of the analysis had the following 
requirements. (1) There was a minimum of seven probe pairs 
scored. (2) At least seven probe pairs that were designed for a given 
OTU were scored positive. Probe pairs were positive if the PM 
probes had higher intensities than the corresponding MM probes. 
(3) Ranked response scores (r scores) of probe pairs in Quartile_l 
were >0.8, Quartile_2>0.93, and Quartile_3>0.98. The r score 
was calculated based on the A and T count in the probe 
sequences, and it measures the potential that a probe pair was 
responding to a target and not the background (see the reference 
[19]). (4) The OTU probe set had a positive fraction >0.92 
(positive fraction is the fraction of probe pairs that scored positive 
in a given OTU). An OTU passing all these criteria passed for 
consideration at Stage 2. For Stage 2 of the analysis, the cross- 
hybridization potential of the remaining OTU was determined. 
Cross-hybridization adjusted response scores (rx) were calculated 
in order to account for potential false positive OTUs, which can 
occur when a sequence in a sample matches some probes used in 
multiple, closely related OTU (for the equations, see the reference 
[19]). The cutoffs for adjusted rx scores for calling species present 
was: Q_l>0.22, Q_2>0.40, and Q_3>0.42. The presence/ 
absence (binary) data of species passing these criteria, the OTU 
binary data (for OTU belonging to the species called present as 
long as they passed Stage 1 criteria), and the fluorescence 
intensities for those OTUs were used for analyzing community 
structures. An OTU was considered present if it was called present 
in at least two of the three sample replicates. Taxa at higher levels 
(phylum, class, order, family, and genus) were considered present if 
at least one species was called present in a given lineage. 

Community comparison using hierarcliical clustering, 
MRPP and indicator analyses of the relative abundance of 
OTUs 

To investigate similarities between the samples, hierarchical 
clustering analysis was performed based on the relative abundance 
of OTUs using PC-ORD (v.6.12; MjM Software, Gleneden 
Beach, OR). In the analysis, distance was measured by Bray-Curtis 
and group average was used as a group linkage method. All 
relative intensity data across the entire sample set was included in 
the analysis as long as an OTU was called as present in at least one 
sample. Raw microarray intensity data were imported into PC- 
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ORD, standardized by general relativization by columns (sam- 
ples), and subsequendy transformed by arcsine square root to 
convert fluorescence intensity data with an arbitrary unit to the 
numbers that are closer approximations of amplicon amounts used 
for microarray hybridization. 

Multi Response Permutation Procedure (MRPP) in PC-ORD 
was used to examine the differences among particular a priori 
groups, which were resolved at a particular level of cluster by 
hierarchical clustering analysis employing Bray-Curtis distance 
measurement. MRPP is non-parametric procedure, which does 
not require distributional assumptions, so it is suitable for 
analyzing ecological community data. MRPP tests the nuU 
hypothesis of no difierence between within-group distances and 
across-group distances [20-22]. It calculates which is a weighted 
pairwise distance within each group and determines the proba- 
bility (j)) value of <5 being equal, or smaller. It also generates the A 
statistic value, which indicates chance corrected within-group 
agreement [^4 = 1- (observed d/ expected d)\. If all items are 
identical within groups [5 = 0), then y4 = 1 , if heterogeneity within 
groups equals expectation by chance, then A = 0, and if 
heterogeneity within groups higher than expectation by chance, 
then A<0. As A value approaches to 1, pre-defined groups are 
more similar each other. 

To determine which OTUs were responsible for the differences 
among the seasonal field samples, or among the laboratory-grown 
communities cultured with various carbon sources, indicator 
species analysis was conducted following the method by Dufrene 
and Legendre [23] using PC-ORD. The analysis calculates 
indicator values of each species (or, OTU) for a particular group 
by considering both the abundance of the species in a specific 
group and the faithfulness of occurrence of the species in members 
of the group. The equation for the indicator value of species i in 
groupj is as follows: iNDVAi^j = AyXByXlOO. Ay is a measure of 
specificity and is calculated by dividing the mean number of 
species i across the members of group j by the sum of the mean 
numbers of species i over all groups. By is a measure of fidelity and 
is obtained by dividing the number of members in group j, where 
species i is present, by the total number of members in that group. 
By is maximized when species i is present in all members of group 
j. When we compared the MWWTP seasonal field communities in 
this study, "group" corresponds to community (for example, the 
April 25, 2012 sample) and "members" corresponds to the three 
replicates of that community. The statistical significance of 
indicator values was tested by Monte Carlo randomization with 
1,000 permutations. The indicator analysis was performed at a 
particular cluster level resolved by hierarchical clustering analysis. 
The relative abundances of indicator OTUs were visualized by R 
(version. 2.15.1; Institute for Statistics and Mathematics, 
Wirtschaftsuniversitat Wien [http://www.r-project.org]) [24]. 
For the R heat-plot, microarray intensities of individual OTUs 
were centered and scaled by subtracting the mean of the row 
(OTUs) across all samples and then dividing the resulting values by 
the standard deviation of the row. The Z scores in heat-plots 
generated by R represent the distances between the scaled 
microarray intensities of individual OTUs and the mean of the 
scaled OTUs. Note that the heat-maps generated by this method 
are applicable only for comparing OTUs across samples, but not 
for comparing among OTUs within samples. Average linkage 
methods were used for hierarchical clustering by R. 



Examination of phylogenetic dissimilarities among the 
field-sampled and laboratory-grown communities using 
Principal Coordinate Analysis (PCoA) 

A representative 16S rDNA sequence for each OTU was 
obtained from the GreenGenes database and ahgned by the 
NAST ahgner (Lawrence Berkeley National Laboratory [http:// 
greengenes.lbl.gov/ cgi-bin/ nph-export_records.cgi]) [25,26] . The 
filtering criteria for the sequences were: a minimum base number 
of 1,250 and a maximum non-ACGT character count in the 
sequences of 50. The alignment comprised the sequences of 7,799 
OTUs, which were detected in at least one sample. Using 
FastTree (v. 2. 1.7; Lawrence Berkeley National Laboratory 
[http://www.microbesonline.org/fasttree]) [27], a phylogenetic 
tree was constructed from the multiple sequence alignment 
employing the generalized time reversible (GTR) models of 
nucleotide evolution for distance measurements. FastTree infers 
approximately-maximum-likelihood phylogenetic trees. FastTree 
uses a heuristic variant of neighbor joining to get a rough tree 
topology first, corrects branch lengths using a mix of nearest- 
neighbor interchanges and subtree- prune-regraft moves, and then 
improves topology and branch lengths by maximum likelihood 
rearrangements. Principal Coordinate Analysis (PCoA) was 
performed using Fast UniFrac (Department of Chemistry and 
Biochemistry, University of Colorado at Boulder [http://unifrac. 
colorado.edu]) [28] and the PCoA result was plotted using 
SigmaPlot (v. 10; Systat Software Inc., San Jose, CA). A UniFrac 
distance matrix was calculated from the tree considering within 
sample normahzed abundance weights. In previous PhyloChip 
validation experiments, the concentrations of PCR amplicons were 
shown to correlate with the log2 values of the fluorescence 
intensities with the correlation coefficient (r) of 0.917 [29]; 
therefore, we used the log2 values of the fluorescence intensity 
scores as the abundance weight. 

Construction of a phylogenetic tree and visualization of 
the microbial composition of the April 25, 2012 
field-sampled community and the laboratory-cultured 
communities 

A single OTU per family was chosen from the GreenGenes 

database to represent each family in a phylogenetic tree (Table 
SI). Non-chimeric 16S rDNA sequences of 120 OTUs were 
obtained from the Hugenholtz 7,682 character data set at 
GreenGenes and aligned by NAST ahgner of GreenGenes as 
described above. An approximate-maximum-likelihood phyloge- 
netic tree was constructed from the multiple sequence alignment 
using FastTree as described above and the tree- was \ isualized by 
iTOL (Interactive Tree Of Life) [30]. An archaeal sequence 
(DQ_300318.1) was included in the tree for rooting. 

Accession reference for PhyloChip data 

AH PhyloChip data are available for download from the Green- 
Genes database [26] [http://gTeengenes.lbl.gov/Download/ 
Microarray_Data/MIAME_data_files_Wastewater-reactors_FIiIlson. 
zip]. AH PhyloChip data were deposited in NCBI's Gene Expression 
Omnibus [31] and are accessible through GEO Series accession 
number GSE58251 [http://www.ncbi.nlni.nili.gov/geo/query/acc. 
cgi?acc = GSE58251]. 
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Results 

IVlicrobial communities detected in the aeration-basin 
samples 

A species was considered present if it was called present in at 
least two of the three sample rephcates. A phylum was considered 
present if at least one species was called present in a given phylum 
(see the Materials and methods for details). The July 20, 2011 
sample was the most diverse community consisting of 24 phyla, 
234 species, and 2,200 OTUs (Fig. 2). The October 19, 2011 
sample contained 17 phyla, 192 species, 1,704 OTUs, and the 
April 25, 2012 sample had 16 phyla, 182 species and 1,886 OTUs. 
The numbers of the OTUs at phylum level (or, class level for the 
major phyla) detected in the seasonal samples are listed in the 
Table S2. The majority (96%-98%) of OTUs in each sample 
belong to four phyla: Proteohacteria, Bacteroidetes, Firmicutes, and 
Actinobacteria. While probes that target these four phyla are 
overrepresented on the PhyloChip G3, other phyla similarly 
overrepresented on the PhyloChip G3 were not detected at 
comparable levels in the aeration-basin samples (Table S2). 

Common OTUs among the seasonal aeration-basin 
bacterial communities 

Although the major phyla found in the seasonal samples were 
very similar, the samples shared a relatively low number of OTUs. 
A total of 1,027 OTUs (35% of the total OTUs counted, 33% of 
total species, 36% of total genus, 41% of total family, 44% of total 
order, 48% of total class and 52% of total phylum) were 
commonly detected in all field samples (Fig. 2). The common 
OTUs consisted of the phyla Proteohacteria (59%), Bacteroidetes 
(22%), Firmicutes, (11%), Actinobacteria (7%), and others (2%; 
Planctomycetes, Gemmatimonadetes , Chloroflexi, Verrucomicrobia, 



Synergistetes, TM7, Acidohacteria, Tenericutes, GN04, and 
OPll). 

The common Proteohacteria consisted of the classes 
Proteohacteria (61%), y-Proteohacteria (28%), OL-Proteobacteria 
(10%), and 5-Proteobacteria (0.7%). In the Bacteroidetes phylum, 
the major classes were Flavobacteria, Sphingobacteria, and 
Bacteroidia. In the Firmicutes phylum, 67% was of the class 
Clostridia, and the rest was Bacilli. AU of the common 
Actinobacteria OTUs belong to the class Actinobacteria. 

OTU variations by aeration-basin seasonal sampling 
dates 

The bacterial communities collected at various sampling dates 
were distinguished from each other at hierarchical cluster level 3 
based on the relative abundance of OTUs (Fig. SI). The 
differences among the bacterial communities by sampling dates 
based on the relative abundance of OTUs were moderate (MRPP: 
^ = 0.47, /) = 0.0004). 

Indicator analysis was used to identify OTUs whose relative 
abundances significantly differed between sampling dates. The top 
200 indicator OTUs (INDVAL>40.5, p<0.05) are shown in Fig. 
S2. The indicator OTUs consisted of both major and minor 
classes. Among the 200 indicators, 98 OTUs were the indicators 
for the July 20, 20 1 1 community, which belong to the classes 
Actinobacteria, a-Proteobacteria, y-Proteohacteria, Clostridia, 
Sphingobacteria, Planctomycea, fi-Prvteohacteria, Synergistia, 
Acidobacteria, 6-Proteohacteria, Erysipelotrichi, Solibacteres, and 
unclassified y4SF7_OZ)7. The October 19, 2011 community had 
50 indicator OTUs, which belong to the classes y-Proteobacteria, 
(i-Proteobacteria, Holophagae, Acidobacteria, Bacteroidia, Flavo- 
bacteria, OL-Proteobacteria, Armatimonadia, Planctomycea, and 
Sphingobacteria. The 52 indicator OTUs for the April 25, 2012 
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Figure 2. Seasonal composition of aeration-basin bacterial community samples, as represented by the number of OTUs. The 

composition is shown at the phylum level, or at the class level for the phylum Proteohacteria. An OTU was counted present if it was detected in at 
least two of three sample replicates. 
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community belong to the classes P-Proteohacteria, y-Proteobac- 
teria, Bacilli, Sphingobacteria, a-Proteobacteria, Clostridia, and s- 
Proteobacteria. 

Taken together, the above results importantly establish a 
baseline for the seasonal variation of aeration-basin community 
composition. 

Laboratory culturing from aeration-basin inoculum 

Flask-culture growth curves are presented in Figs. S3A and B. 
Growth with glucose had approximately a 10 h lag phase, and a 
peak OD500 nm value of 0.5 was reached 2 1 h after inoculation. 
Growth with D-threonine and chloroacetate commenced follow- 
ing a 45 to 75-h lag phase. The standard deviations for cdl the 
replicates are small, except for the 50 mM chloroacetate 
condition, which attained the highest ODgoo (0.8) and 
contained 4.5 times more initial carbon than all other carbon 
sources. For both the [C2mim]Cl and no-carbon conditions, the 
ODgoo 11m values did not change significantiy, suggesting that 
[C2mim]Cl was tolerated but likely not metabolized. These results 
are important because they demonstrate that in addition to 
glucose, D-threonine and chloroacetate (up to 50 mM) can be 
effectively utilized as carbon sources by some of the aeration-basin 
microbes, although D-threonine and chloroacetate were associated 
with significantly longer lag phases than that observed for glucose. 
These results also show that no significant growth was observed in 
the [C2mim]GI flask cultures despite monitoring three replicates 
through 140 h post inoculation. 

Growth curves for the reactor-cultures are shown in Fig. S,3C 
for glucose, D-threonine, and 1 1 and 23 mM chloroacetate. 
Despite irregularities in the curves due to air-bubble entrainment 
in the recycle line, and on some occasions due to adhesion of 
biomass, trends are consistent among replicates and these curves 
show more detail than the flask growth curves derived from 
intermittent sampling. For glucose, a 1 0 h lag period was followed 
by growth, which peaked at 20 h, similar to the flask-cultures. In 
the case of D-threonine, ODgoo nm values declined over the first 
20 h, then increased beginning at 40 to 50 h, peaking at around 
80 h, followed by a sharp decline. The flask culture did not show 
the sharp decline, although the peak occurred at around 70 h. 
1 1 mM chloroacetate growth had two plateaus in succession, 
however the timing between the replicates varied; PhyloChip 
samples were taken after the second plateau. The replicates of the 
23 mM chloroacetate runs were much more consistent than for 
1 1 mM chloroacetate, each showing a 30 h lag, followed by 
growth to a plateau at 40 h. The flask-culture with 23 mM 
chloroacetate had much longer lag and growth phases compared 
to the reactor-cultures. In the no-carbon reactors (data not shown), 
ODgoo nm values dropped by 50% from their initial values in first 
20 to 50 h, and remained constant thereafter. 

Composition of the laboratory-cultured bacterial 
communities 

The number of OTUs in each laboratory culture was reduced 
compared to its inoculum (the preserved and maintained April 25, 
2012 aeration-basin sample; Fig. 3A, Table S2). Interestingly, 
large numbers of OTUs were detected in the [C2mim]Cl, and no- 
carbon cultures despite their w.ry low (x'll mass. This is possibly a 
consequence of the less stringent positive OTU criteria used for 
these low cell mass bacterial communities (see Materials and 
methods). Briefly, for the low ceU mass cultures, three replicates 
were pooled together and analyzed using a single PhyloChip, with 
any detected OTUs considered positives. In contrast, for the 
higher ceU mass cultures, the three replicates were separately 
analyzed and positive OTUs were more stringendy constrained to 



those detected in at least two replicates (Fig. 3A). The smallest 
number of OTUs (458) was detected in the 50 mM chloroacetate- 
cultured community, for which the largest ODgoo nm was 
measured. 

The most abundant phyla in the laboratory-cultured commu- 
nities were Proteobacteria, Bacteroidetes, and Actinobacteria. The 
y-Proteobacteria was greatiy enriched in almost aU laboratory- 
cultured communities (Fig. 3B). While the y-Proteobacteria made 
up only 15% of the total OTUs in the April 25, 2012 field 
community, it represented 45 % of the total OTUs in the 1 1 mM 
chloroacetate, 59% in the 50 mM chloroacetate, 59% in the 
glucose, 46% in the D-threonine, and 53% in the [C2mim]Cl 
flask-cultured communities. In contrast, ^-Proteobacteria was 
reduced in most laboratory-cultured communities. The Clostridia 
and Bacilli of the Firmicutes phylum were significandy reduced in 
all laboratory-cultured communities. The phylum Bacleroideles, 
especially the FlavobacLeria class, was greatly reduced in the 
communities cultured with chloroacetate at high concentrations, 
or with [C2mim]Cl. The Actinobacteria class was detected in all 
laboratory-cultured communities. 

The result shown in Fig. 3 is important because it demonstrates 
that laboratory flasks and reactors-cultured diverse microbial 
communities (although the observed OTU richness was lower than 
that for the sampled aeration-basin community) and specific 
classes were differentially affected by both the culturing method 
and selective media components. 

Similarities among the aeration-basin field communities 
and the laboratory-cultured communities 

Microbial composition similarities among the communities were 
inferred by PCoA, which employed UniFrac distance measure- 
ment (Fig. 4). The aeration-basin communities were differentiated 
from the laboratory-cultured communities along the PI axis, 
which explained the majority of total variation (60.8%). The P2 
(21.0% of total variation) and P3 (6.8% of total variation; Fig. S4) 
axes further differentiated the laborator^'-cultured communities. 

Hierarchical cluster analysis using Bray-Curtis distance matrix 
and group average finkage method supported the PCoA analysis 
result (Fig. 5). The April 25, 2012 field communities and no- 
carbon-cultured communities were grouped together (cluster I) 
and were distinct from the laboratory-cultured communities. 
Cluster II, which consisted of the 1 1 mM and 23 mM chlor- 
oacetate and the D-threonine-cultured communities, was more 
closely located to cluster I containing the April 25, 2012 
communities compared to cluster III, which contained the glucose, 
[C2mim]Cl, and 50 mM chloroacetate-cultured communities. 

Comparison of the microbial richness among the April 
25, 2012 aeration-basin community and the laboratory- 
cultured communities 

A total of 120 families were found in the April 25, 2012 samples 
and the flask-cultured communities (Fig. 6). There were 83 
families in the April 25, 2012 field community, 49 families in 
1 1 mM chloroacetate, 20 families in 50 mM chloroacetate, 3 1 
families in D-threonine, 29 families in glucose, and 56 families in 
[C2mim]Cl cultures. The families detected in the laboratory 
cultures were phylogeneticaUy diverse as shown in the tree in 
Fig. 6. In general, the April 25, 2012 sample contained more 
diverse species and families in most phyla. The following families 
in particular contained more diverse species in the aeration-basin 
community: Lachnospiraceae, Rvminococcaceae , Slreptococcaceae , 
ClosLridiaceae, and EubacLenaceae (belonging to the Firmicutes 
phylum), Rhodocyclaceae (belonging to the p-Proteobacteria class). 
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Figure 3. Number of OTUs and the class composition of the April, 201 2 aeration-basin and laboratory-cultured samples. (A) Numbers 
of OTUs counted present in the preserved and maintained aeration-basin samples and the laboratory-cultured samples, as a function of the counting 
method. (B) Relative OTU richness at the class level. Relative OTU richness was calculated by dividing the number of OTUs (counted if present in at 
least two of three replicates) for a given class by the total number of OTUs in a given sample (shown in parentheses). _AB indicates aeration basin 
sample, _F indicates flask cultures, and _R indicates reactor cultures. An OTU was counted present if it was detected in at least two of three sample 
replicates, except for the [C2mim]CI and No-Carbon flask-culture samples (three replicates were pooled together) and for the No_Carbon_R1 and 
No_Carbon_R2 (data from a single PhyloChip) (see IVIethods and Results). 
doi:1 0.1 371 /journal.pone.01 05689.g003 



Moraxellaceae [y -Proteobacteria), Gemmatimonadaceae [Gemmati- 
monadetes), hosphaeraceae (Planctomycetes), and Sphingohacter- 
iaceae {Bacteroidetes). In contrast, the y-Proteohacteria were more 
diverse in some of the flask-cultured communities than in the 
aeration-basin community: April 25, 2012 community (24 species), 
glucose (34 species), [C2mim]Cl (38 species), 1 1 mM chloroacetate 
(31 species), 50 mM chloroacetate (22 species) and D-threonine 
(16 species). The Actinobacteria phylum contained a comparable 
number of species between the [C2mim]Cl (20 species) and the 
field community (19 species). 

Comparison of the microbial abundance in the 
laboratory-cultured communities 

The relative abundances of the top 50 families are shown in Fig. 
S5. Most families showed stronger microarray intensities in the 
aeration-basin communities than in the laboratory-cultured 



communities. However, Enterobacteriaceae , Aeromonadaceae and 
Succinivibrionaceae belonging to the y-Proteobacteria class were 
more relatively abundant in the glucose and [C2mim]Cl-cultured 
communities. It appears that [C2mim]Cl also enriched Xantho- 
monadaceae (^/-Proteobacteria), Rhizobiaceae {a-Proteobacteria), 
and Micrococcaceae {Actinobacteria). The Pseudomonadaceae (y- 
Proteobacteria) was enriched the most in the 23 mM chloroacetate 
reactor cultures, while Oxalobacteraceae {[^-Proteobacteria) was 
enriched the most in the 23 mM chloroacetate flask cultures. The 
Burkholderiaceae {fS-Prvleobacteria) was most abundant in the 
50 mM chloroacetate cultures. 

Significantly enriched individual OTUs were identified by 
hierarchical clustering using a Bray-Curtis distance matrix and the 
flexible beta linkage method followed by indicator analysis. OTUs 
(minimum INDVAL&66.7, p<0.05) enriched in laboratory- 
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cultured communities are shown together in Fig. 7. The majority 
of enriched microbes were y-Proteohacteria. 

Ninety-three percent of the top 100 enriched OTUs in the 
glucose and [C2mLm]Cl -cultured communities belong to Entero- 
bacteriaceae (the genus Cronohacter , Erwinia, Escherichia, Kleb- 
siella, Pantoea, Raoultella, Salmonella, Serratia, Sodalis, and 
unclassified) and Aeromonadaceae families (cluster 5 and cluster 4, 
Fig. 7). Interestingly, these Enterobacteriacea OTUs appeared to 
be enriched in the no-carbon added flask culture as well, but not in 
the no-carbon added reactor cultures. In contrast, the Aero- 
monadaceae OTUs were not enriched in any of no-carbon added 
cultures. 

Ninety-eight percent of the enriched OTUs by D-threonine 
were Pseudomonadaceae (cluster 3, Fig. 7). Most of them were 
unclassified Pseudomonas species and the rests were classified 
Pseudomonas species [P. fragi, P. fuscovaginae, P. lutea, P. 
mendocina, P. mosselii, P. nitroreducens , P. stutzeri, P. veronii, 
and P. viridiflava), unclassified 5-Proteobacteria, and Flavobacter- 
iaceae. 

In all 1 1 mM chloroacetate and 23 mM chloroacetate flask 
cultures, the majority of the indicator OTUs were Pseudomona- 
daceae {P. fuscovaginae, P. mendocina, P. nitroreducens, P. 
stutzeri, P. veronii and unclassified Pseudomonas sp.) and 
Alcaligenacea [fi-Proteobacleria) (cluster 2, Fig. 7). The rests were 
Oxalobacteraceae [fi-Proteobacteria), unclassified S-Proteobacteria 
and Flavobacteriaceae . 

The Pseudomonadaceae family was also enriched in the 23 mM 
chloroacetate reactor culture. However, no indicator OTU 
belonged to the Alcaligenaceae family, unlike all other chloroa- 
cetate-cultured communities. 

The high concentration (50 mM) of chloroacetate enriched 
various fi-Proteobacteria (the families Burkholderiaceae , Alcaligen- 
aceae, Oxalobacteraceae, and Methylibiaceae), as well as Pseudo- 
monadaceae. The enrichment oi Burkholderiaceae was character- 
istic for this community. The Burkholderiaceae OTU indicators 



consisted genus Burkholderia (18 OTUs) and Ralstonia (2 OTUs). 
Some of the Burkholderia OTUs in one of the replicate cultures 
were extremely enriched showing approximately 10-60 times 
stronger microarray intensities (cluster 1-a, Fig. 7). 

The results shown in Figs. 7 and S5 demonstrate that glucose, 
D-threonine, chloroacetate, and [C2mim]Cl differentially affect 
community composition, a key characteristic of selective media. 
Furthermore, these results show specifically which subsets of the 
community are respectively enriched in the presence of glucose, D- 
threonine, chloroacetate, or [C2mim]Cl, establishing baselines for 
aeration-basin community composition changes following propa- 
gation in laboratory flask or reactor culture. 

Discussion 

We report here our foundational efforts towards the develop- 
ment of a laboratory model of an industrial bioreactor failure 
scenario that introduces genetically engineered microorganisms 
(GEMs) to a downstream municipal wastewater treatment plant 
(MWWTP). Our first step was to sample microbial communities 
from the MWWTP's aeration basin (a potential hot spot for 
horizontal gene transfer) at three seasonal time points to establish a 
baseline for a MWWTP community composition (Fig. 2). Since 
microbial community composition in activated sludge treatment 
systems is dynamic and can be affected by various environmental 
factors including temperature and sources of wastewater influx, it 
is not surprising that we observed moderate variation in 
composition between our July 20, 2011, October 19, 2011, and 
April 25, 2012 aeration-basin samples. Although the number of 
OTUs detected in common across all samples was relatively low, 
the microbial composition at the higher taxa level was very similar 
(Fig. 2). It is noteworthy, though, that the OTUs of the PhyloChip 
G3 are at the very fine taxonomic level (sub-species) and have very 
low sequence divergence among them (an average of 0.5%) [19]. 
The major phyla of the aeration-basin community that we found 
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Figure 5. Hierarchical clustering of the aeration-basin and the laboratory-cultured samples. The clustering was constructed with the 
relative OTU abundance data employing the Bray-Curtis distance measurement and group average linkage methods. _F indicates flask cultures and 
_R indicates reactor cultures. 
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(Proteobacteria, Bacteroidetes, Firmicutes, and Actinobacteria) are 
similar to those found in a previous study of tiie microbial 
community of a Hong Kong wastewater plant that employed 
DNA and cDNA sequence analysis [8] . In contrast to this previous 
study that examined only one seasonal community at the phylum 
level (except selectively examining the microbes involved in 
nitrification at the genus level), our work investigated microbial 
communities at various taxonomic levels in depth and also 
compared the communities between seasons. Our observations 
place weight on the importance of pursuing multi-seasonal 
assessment of GEM survival and gene transfer rates within 
MX'WVTP microbial communities in future research efforts. At the 
same time, the moderate extent of seasonal variation in microbial 
community composition provides a degree of confidence that an 
assessment made at a particular seasonal time point will be 
representative of other time points. 

Having established a baseline for seasonal variation in aeration- 
basin community composition, we then investigated how the 
community changed when propagated in the laboratory. 



Laboratory flask and reactor systems cultured fairly similar yet 
distinct microbial communities (Figs. 3-5, 7, S3-5). While a 
proper bench-scale simulation of an activated sludge treatment 
system requires the use of a bioreactor, the observed similarity 
supports the utility of screening carbon sources and growth 
conditions in flasks before progressing to reactors. While 
laboratory-cultured communities generally contained fewer taxa, 
the major phyla and classes present in the aeration-basin 
communities were also detected in the laboratory-cultured 
communities, which retained reasonably high phylogenetic diver- 
sity. Some OTUs, especially belonging to Firmicutes that was one 
of the major phyla in the MWWTP communities, were much less 
abundant in laboratory cultures than in aeration-basin commu- 
nities. We speculate that differences in oxygen concentrations and 
distributions (and the dynamics thereof) between small-scale 
laboratory flask and reactor cultures and the large-scale wastewa- 
ter treatment plant could explain some of the observed compo- 
sition differences, such as the decreased abundance of anaerobic 
Clostridia in laboratory cultures (Fig. SB). While the mixed liquor 
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is intensively aerated and mbced in tlie wastewater treatment plant 
aeration basin, the passage of the mixed liquor through a non- 
aerated clarifier could transiently support the growth oi Clostridia. 
There is no equivalent passage through a non-aerated compart- 
ment in the laboratory culture system, which is constantly aerated 
to saturation. In contrast, some OTUs were more abundant (or, 
detected only) in laboratory-cultured communities. The various 
laboratory culture conditions may have enriched for microbes 
otherwise in low-abundance in the aeration-basin. 

Beyond assessing the extent to which our laboratory growth 
conditions simulated those of the aeration-basin, we tested the 
effectiveness of chloroacetate and D-threonine carbon sources and 
the microbial growth inhibitor [C2mim]Cl as selective media 
component candidates for future GEM survival and gene transfer 
experiments. We anticipated that these media amendments would 
promote the activity of carbon catabolic or efflux-pump tolerance 
genes, rare in MW'WTP microbial communities, which could 



serve as selective markers in these future studies. Such a future 
study would introduce a GEM (containing a carbon catabolic or 
efflux-pump tolerance selectable marker) to our aeration-basin 
laboratory model under the appropriate selective growth condi- 
tion, and then monitor the survival of the GEM and the horizontal 
transfer and propagation of the portion of its DNA encoding the 
marker. 

Although the 50 mM chloroacetate flask cultures attained the 
highest ODgoo nm (Fig- S3), they contained the smallest number of 
OTUs (485) of all the conditions. This suggests that the diversity of 
microbes that can survive in high concentration of chloroacetate is 
relatively low. The genus Bu7'kholderia, in particular, was enriched 
in the 50 mM chloroacetate flask cultures (Figs. 7, S5). The 
haloacetate dehalogenase (EC.3.8.1.3) from a Burkholderia species 
had been characterized [32] to catabolize chloroacetate to 
glycolate. Glycolate, in turn, can be oxidized to glyoxylate by 
glycolate oxidase and used as a carbon and energy source. The 
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Figure 7. OTUs enriched by chloroacetate, glucose, or D-threonine. IVlicroarray relative intensities for the OTU indicators (rows) are shown 
across samples (columns). The color gradient from blue to red represents increasing microarray intensity. 
doi:10.1371/journal.pone.0105689.g007 



genomic information of fi. gladioli, B. mallei, and B. phytofirrmm 
available at MetaCyc (Stanford Research Institute International 
[httpiZ/metacycorg]) [33] indicates that these three species have 
genes encoding haloacid dehalogenase (EC. 3. 8. 1.2). To this end, 
the unclassified Burkholderia species detected in this study may be 
similarly able to metabolize chloroacetate. The Pseudomonas 
genus was enriched by chloroacetate at all concentrations tested 
(11, 23, and 50 mM; Fig. 7). The haloacid dehalogenase of 
Pseudomonas sp. YL and P. putida had been characterized, and 
an E. coli overexpressing haloacid dehalogenase can metabolize 
chloroacetate as a sole energy source [34—39] . While Pseudomonas 
sp. YL and P. putida were not detected in this study, the 
unclassified Pseudomonas species found in this work may have a 
similar ability to catabolize chloroacetate. The Alcaligenaceae 
family (Achromohacter genus, unclassified species) was also 
enriched by chloroacetate (Fig. 7). Although not detected in this 
work, A. xylosoxidans , a species belonging to Achromohacter, has a 
gene encoding haloacetate dehalogenase. Taken together, these 
three observations support chloroactetate as a selective media 
component candidate, and haloacid dehalogenase as a selective 
marker, for our future GEM survival and gene transfer 
experiments: 1) Burkholderia, Pseudomonas, and Achromohacter 
were all significantly enriched in the chloroacetate-amended 
laboratory cultures; 2) in these same genera, haloacid dehalo- 
genases have previously been identified, heterologously expressed 



and characterized; and 3) haloacid dehalogenase can not only 
detoxify chloroactetate to glycolate, but also enable the metabo- 
lism of chloroacetate as a sole energy source. 

Beyond chloroacetate, many of the Pseudomonas and some of 
the Alcaligenaceae were also enriched by D-threonine. D- 
threonine aldolase catabolizes D-threonine into glycine and 
acetaldehyde, and acetaldehyde feeds into the TCA cycle. The 
D-threonine aldolase of Pseudomonas has not been studied in 
depth, but the enzyme was characterized in Xanthomonadaceae 
{Xanthomonas oryzae) and Alcaligenaceae [Achromohacter xylosox- 
idans), which are closely related to the OTUs detected in the D- 
threonine cultures [40,41]. These observations support D-threo- 
nine as an additional selective media component candidate, and 
D-threonine aldolase as an additional selective marker, for our 
future experiments. 

Few microbes have been shown to grow in the presence of the 
ionic liquid [C2mim]Cl [14]. Although our ODgoo measure- 
ments did not detect any net growth in the presence of [G2mim]Cl 
(Fig. S3B), a high number of OTUs was detected for the 
[C2mim]Cl sample. We suspect that although many microbes 
may have been non-viable, dead, or dormant at the time of 
[C2mim]Cl culture sampling, their respective DNA persisted in 
the medium long enough for PhyloChip detection. To distinguish 
which OTUs are active, in future experiments, expressed rRNA 
could be applied to the PhyloChip and recovery experiments and 
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live/dead staining could be performed. Given the relatively high 
abundance in the [C2mim]Cl sample of the Xanthomonadaceae, 
Rhizobiaceae, Micrococcaceae, Enterobacteriaceae , Aeromonada- 
ceae, and Alcaligmaceae famihes (Figs. S5, 7), it appears that 
microbes belonging to these families may have persisted or 
propagated (under the ODgot) detection limit) better than others in 
the presence [C2mim]Cl. Since an efilux pump for [C2mim]Cl 
tolerance was identified in Enterobacter lignolyticus SCFl and 
cross-validated in E. coli [14], [C2mim]Gl may also prove to be a 
valuable selective media component candidate in our future 
experiments. 

Here, we demonstrated the capacity to culture a microbial 
community using preserved inocula from a municipal wastewater 
treatment plant, and maintain adequate diversity and repeatability 
to test GEM survival and gene transfer rates in laboratory-scale 
model system. Our results show specifically which subsets of the 
community are enriched in the presence of glucose, D-threonine, 
chloroacetate, or [C2mim]Cl, establishing baselines for aeration- 
basin community composition changes following propagation in 
laboratory flask or reactor culture. The observation that microbial 
communities cultured with 11 or 23 mM chloroacetate and D- 
threonine are more similar to the sampled aeration-basin 
communities than those cultured with glucose, [C2mim]Cl, or 
50 mM chloroacetate (Fig. 5), may prove useful when choosing a 
particular selective medium over another. Our analysis supports 
haloacid dehalogenase, D-threonine aldolase, and the [G2mim]Cl 
efilux pump from Enterobacter lignolyticus SCFl as candidate 
selective markers, and chloroactetate, D-threonine, and 
[C2mim]Cl as selective media components, respectiviJy, for our 
future GEM survival and gene transfer experiments. This work 
establishes a methodological foundation for assessing and mitigat- 
ing the risks of future large-scale microbial metabohc engineering 
projects, including those extending beyond the bioreactor. 

Supporting Information 

Figure SI Hierarchical clustering of the seasonal aera- 
tion-basin samples based on the relative OTU abun- 
dances. The Bray-Gurtis distance measurement and group 
average linkage methods were employed for the clustering. 
(TIF) 

Figure S2 Indicator analysis of the seasonal aeration- 
basin sample relative OTU abundances. Microarray 
relative intensities for the top 200 OTU indicators (rows; indicator 
value >40.5, ^<0.05) are shown for each sample replicate 
(columns). The color gradient from blue to red represents 
increasing microarray intensity. 
(TIF) 

Figure S3 Microbial growth curves. (A, B) Flask culture 
growth curves for (A) glucose, D-threonine, chloroacetate carbon 

References 

1. Scfhcr C, LoUicr M, Jrzcqucl K. Cornu JY, Amalric L, ct al. (201.3) 
Decontamination of a polyclilorinatcd biplirnyls-contaminatcd soil by phytore- 
mediation-assistrd bioaugmcntation. Biodcgradation 24: 549—562. 

2. ParnellJJ, Denet VJ, Park J, Tsoi T, TicdjcJM (2010) Environmentally relevant 
parameters afiFecting PCB degradation: carbon source- and growth phase- 
mitigated effects of the expression of the biphenyl pathway and associated genes 
in Burkholdrria xrnovorans LB400. Biodcgradation 21: 147-156. 

3. Wakasa Y, Takaiwa V (201.3) I'hr use of rice seeds to produce human 
pharmaceuticals for oral therapy. BiotechnolJ 8: 113,3—1143. 

4. Seo SW, Yang J, Min BE, Jang S, LimJH, et al. (2013) Synthetic biolog>': tools 
to design microbes for the production of chemicals and fuels. Biotechnology' 
Advances 31: 811-817. 

5. Moe-Behrens GH, Davis R, Hayncs KA (2013) Preparing synthetic biology for 
the world. Front Microbiol 4: 5. 



sources; and (B) [G2mim]Gl and no carbon source. Curves are the 
average of triplicates with error bars representing one standard 
deviation between replicates. (C) Reactor culture growth curves 
(three replicate curves each) for glucose, D-threonine, chloroace- 
tate carbon sources. Optical den.sity was measured continuously by 
means of spectrophotometer in recycle line of reactor, with 
oscillations resulting from air-bubble entrainment in the recycle 
line or other irregularities due to the adhesion of biomass. 
(TIF) 

Figure S4 Principal Coordinate Analysis (PCoA) of the 
aeration-basin and the laboratory-cultured sample 
bacterial communities. Samples are projected onto the first 
(PI) and third (P3) principal coordinate axes. Three sample 
replicates are plotted, except for the [G2mim]Gl and No-Carbon 
laboratory-culture samples, for which the three replicates were 
pooled together (see Methods and Results). 
(TIF) 
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